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Abstract 

Semigraphoids are combinatorial structures that arise in statistical learning the- 
ory. They are equivalent to convex rank tests and to polyhedral fans that coarsen 
the reflection arrangement of the symmetric group S n . In this paper we resolve two 
problems on semigraphoids posed in Studeny's book ^H]) and we answer a related 
question by Postnikov, Reiner, and Williams on generalized permutohedra |17j . We 
also study the semigroup and the toric ideal associated with semigraphoids. 

1 Introduction 

A conditional independence ( CI) statement on a finite set of random variables, in- 
dexed by [n] = {1,2, ...,n}, is a formal symbol [i Jl_ j \ K ] where K C [n] and 
i,j £ [n]\A. The symbol [i _LL j \ K] represents the statement that the random 
variables i and j are conditionally independent given the joint random variable K. 
For any joint probability distribution on the n random variables, the set M. of all 
CI statements that are valid for the given distribution satisfies the following axiom: 

(SG) If [tJL j \KUi] and [iALi \K\ are in M then so are [iALj \K] and [iALi \KUj]. 

A semigraphoid is any set M. of CI statements which satisfies the axiom (SG). Stu- 
deny's book |18j gives an introduction to semigraphoids and their role in statistical 
learning theory. For further details and references see also Matiis [111 I13j . In this 
paper we construct examples which answer two problems stated by Studeny: 
(Ql) Is it true that every coatom of the lattice of (disjoint) semigraphoids over [n] is 
a structural independence model over [n] ? Question 4, page 194] 

(Q2) Is every structural imset over [n] already a combinatorial imset over [n] ? 

18 Question 7, page 207] 

Our approach is based on the geometric characterization of semigraphoids which 
was developed in [Ej. Let Il n _i denote the (n — l)-dimensional permutohedron 
and let C n = [0, l] n denote the standard n-dimensional cube. The vertices 
of Il ra _i are in bijection with the elements of the symmetric group S n , and with 
the monotone edge paths from (0, 0, ... , 0) to (1,1,..., 1) on the cube C n . The 
2-dimensional faces of C n are in bijection with the CI statements on [n]. Namely, 
[i_LLj | isT] = [j J_Lz | K] represents the 2-face of C n with = 1 for k G K and xi = 
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for I € [n]\(K U {i, j}). The number of these 2-cubes equals <y n := Q)2 n ~ 2 . There 
is a natural surjection from the edges of H n -i onto the 2-faces of C n . Namely, an 
edge of n n _i corresponds to a pair of adjacent monotone edge paths on C n . These 
adjacent paths differ only along a 2-cube [i AL j \K]. In this manner, we identify 
any set A4 of CI statements on [n] with a set of 2-cubes on the boundary of C n . We 
also identify M with a set of edges of the permutohedron II n _i, bearing in mind 
that opposite edges of a square have the same CI statement as their label. 

Each 2-face of the permutohedron II n _i is either a square or a hexagon. By |15j . 
the semigraphoid axiom is equivalent to the following geometric condition on II n _i: 
(SG ) If two adjacent edges of a hexagon are in A4 then so are their two opposites. 
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The normal fan of the permutohedron II n _x is the reflection arrangement of S n . 
Theorem 3 in identifies semigraphoids with fans that coarsen this arrangement. 
Such fans are called convex rank tests. Namely, M. specifies the set of edges of II n _i 
whose dual walls in the normal fan are not present in the convex rank test. 

A basic question about any semigraphoid M is whether its corresponding convex 
rank test is submodular, in other words, whether it is the normal fan of a convex 
polytope. That polytope would then be a Minkowski summand of II n _i. These 
polytopes are known as generalized permutohedra and they were studied in |161 117j . 

Studeny's first question has the following geometric translations: 
(Ql) Is every coarsest convex rank test submodular? 

(Ql) Is every fan which maximally coarsens the S n - arrangement the normal fan of 
a generalized permutohedron? 

In the first version of ^7j, Postnikov, Reiner and Williams asked a similar question: 
(Q3) Is every simplicial fan which coarsens the S n - arrangement the normal fan of 
a simple generalized permutohedron? 

This paper answers all three questions. In Section 2 we derive and explain our 
counterexample for Question (Q3). That example is discussed in |T3 Example 3.8]. 
By Studeny's classification of the 26424 semigraphoids for n = 4, it had been known 
that the answers to Questions (Ql) and (Q2) are affirmative for n < 4. In Sections 
3 and 4 we construct counterexamples for (Ql) and (Q2) with n = 5. 

Question (Q2) has the following reformulation in the setting of toric algebra |14| 
§7]. We represent the semigraphoid axiom as an equation among formal symbols: 

(SG") [iALj\Ku£] + [iAL£\K] = [iALj \K] + [iALi \KUj] 

for all i,j, I, K. These relations span the kernel of the linear map 

A : V n -> Z 2 " , [iALj \ K] h-> e iK + e jK - e K - e ijK - (1) 

A semigraphoid is a solution to the equations (SG") in the semiring {0,+}, rep- 
resenting "zero" and "positive". A semigraphoid is submodular if it is the set of 
zero coordinates of a solution to (SG") in the non-negative real numbers. These 
definitions furnish us with an algebraic representation of a semigraphoid M. and a 
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systematic method for testing submodularity of Jv[ by linear programming. Stu- 
deny's question (Q2) concerns the N-linear span of the columns of the matrix ^4: 
(Q2) Is the semigroup .4(N 7n ) normal, i.e., does it coincide with ,4(R>q) n Z 2 " ? 

In Section 5 we study the toric ideal of A in a polynomial ring in j n un- 
knowns, and we examine how it differs from the subideal generated by the binomials 

(SG W ) [iALj \K\j£] • [iAL£\K] - [iALj \K] ■ [iAL£\KUj\. 

ProDosition l5.1l describes the primary decomposition of this binomial ideal for n = 4. 
We also discuss the problem of deriving the full Markov basis from (SG W ). 



2 A non-submodular simplicial semigraphoid 

Let n = 4 and consider the 4-dimensional cube C4 and the 3-dimensional permuto- 
hedron LT3. Each hexagon on II3 corresponds to one of the eight facets of C4. Each 
facet specifies three semigraphoid axioms, written additively as in (SG"): 
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This is a system of 24 equations in 74 = 24 formal symbols [i-LLj | K\. 

A semigraphoid is a solution to these equations over the semiring {0, +}. More 
precisely, given such a solution vector in {0, +} 24 , the semigraphoid M consists of all 
coordinates [iAL j \ K] that have the value 0. There are 26424 such semigraphoids. 
They form a sublattice of the Boolean lattice {0, +} 24 , with + < 0. Question (Ql) 
concerns the coatoms of this lattice. But let us first resolve Question (Q3). 

We consider the following collection of CI statements: 

M = {[2_U_3|14], [1JL4|23], [1 _U_ 2 | 0], [3iL4|0]}. (2) 

These four symbols are highlighted in the 24 equations above by the use of double 
brackets [ ••• ]. Each equation (individually) can be solved among the positive 
reals after these four symbols have been set to zero, or equivalently they can be 
solved as a system over {0, +}. This shows that M is a semigraphoid. 



4|1|23 114123 




2|3|14 

Figure 1: A simple 3- dimensional polytope with 16 vertices and 10 facets 

The semigraphoid M is represented geometrically by the three-dimensional poly- 
tope in Figure 1. This polytope is simple, i.e., each of the 16 vertices is adjacent 
to three other vertices. The eight vertices whose labels include three bars (such as 
4|2|1|3) correspond to unique permutations in S4 (namely the permutation 4213), 
while the eight vertices whose labels have two bars (such as 4|1|23) correspond to 
pairs of permutations in S4 (namely 4123 and 4132). This partition of S4 into eight 
singletons and eight pairs is the convex rank test of M . The normal fan of the poly- 
tope in Figure 1 is a simplicial fan which is combinatorially (but not geometrically) 
isomorphic to a fan that coarsens the hyperplane arrangement of £4. 

Proposition 2.1. The simplicial semigraphoid M. is not submodular. 
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Proof. Suppose that M. were submodular. Then the above equations have a solution 
in (R>o) 24 whose coordinates in A4 are zero and whose other 20 coordinates are 
positive. The four equations marked by an "<^^" give the following four equations: 

[2_U_3|1] = [2_LL3|0] + [1_LL2|3] 

[1_U_4|3] = [1_U_4|0] + [3JL4|1] 

[3_LL4|1] = [2_LL3|1] + [3_U_4|12] 

[1_U_2|3] = [ 1_LL4 1 3] + [1_LL2 1 34] . 

Adding the left hand sides and the right hand sides of the four equations yields 

[2_LL3|0] + [1_LL4|0] + [3_LL4|12] + [1_LL2|34] = 0. 

This contradicts the assumption that these four values are strictly positive. □ 

The set of all non- negative solutions to the 24 equations is an 11-dimensional 
cone in (M>o) 24 . This cone is isomorphic to the 16-dimensional cone of submodular 
functions on 2^, modulo its 5-dimensional lineality space. Its 22108 faces are in 
bijection with the submodular semigraphoids, or, equivalently, with the generalized 
permutohedra for n = 4. In addition to these, there are 4316 semigraphoids that are 
not submodular. Each of the latter can be represented by a polytope of dimension 
< 3 as in Figure 1. These polytopes have the combinatorial properties of generalized 
permutohedra, but they cannot be realized as Minkowski summands of II3. For 
example, see [HH Figure 5] for a polytope that depicts Studeny's example of a 
semigraphoid that is not submodular (see jTS] and Section 2.2.4]). 

We now give a classification of non-submodular semigraphoids for n = 4 and \M 
small. All simplicial examples are coarsenings (up to relabeling) of the particular 
semigraphoid A4 in Proposition 12.11 The following table lists the number of semi- 
graphoids classified by number of CI statements, their type, and whether they are 
simplicial. Here, the type of a semigraphoid is the triple (mo, mi, 171%) where m% is 
the number of CI statements [i-LLj | K] in M such that \K\ = nit- 
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\M\ 


type 
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simplicial 


total 
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3 A non-submodular coarsest semigraphoid 

We now consider the case n = 5. There are 75 = 80 CI statements, one for each 
two-dimensional face of the 5-cube C5. There are 120 semigraphoid axioms (SG"), 
three for each of the 40 three-dimensional faces of C5 , listed as additive equations in 
the Appendix. The semigraphoids are the solutions of these equations over {0, +} 80 . 
These solutions include the all-zero vector which represents the semigraphoid that 
consists of all 80 CI statements, and which is the maximal element in the lattice of 
semigraphoids. A semigraphoid is said to be coarsest if it is maximal among non- 
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semigraphoids. Geometrically, such a semigraphoid corresponds to a fan which 
coarsens the S^-arrangement but cannot be coarsened to a non-trivial fan. 

We now present the counterexample which answers question (Ql). Our con- 
structions make use of the identification of semigraphoids with convex rank tests 
that was derived in ^S] . Let T denote the partition of the symmetric group S$ into 
fourteen classes as follows. There are eight classes containing 12 permutations each: 



Here 15|234 denotes the class of all permutations ijklm with = {1,5} and 

{k, I, m} = {2, 3, 4}. Similarly, 45|1|23 denotes the class of all permutations ijklm 
with {i,j} = {4,5}, k = 1, and {l,m} = {2,3}. Clearly, T is a pre-convex rank 
test, as each of the 14 classes is the set of all linear extensions of a poset on [5] = 
{1, 2, 3, 4, 5}. Note that the stabilizer of the pre-convex rank test T in S5 has order 
12, because V is fixed under permutations of {1,5} and permutations of {2,3,4}. 
The 14 classes of T are represented by the 14 vertices of the polytope in Figure 2. 

Each pair of adjacent permutations in a given class of T specifies a CI statement. 
For instance, the four-element class 45|1|23 specifies the two CI statements [4_LL5|0] 
and [2_LL3|145], while the 12-element class 15|234 specifies the seven CI statements 



Altogether, we obtain 44 CI statements [• | •] from the 14 classes, and we identify 
the pre-convex rank test T with this set of 44 CI statements. We now prove: 

Theorem 3.1. T is a coarsest convex rank test which is not submodular. 

Proof. To establish this theorem, we must prove the following three claims: 

• r is a convex rank test, i.e. it satisfies the semigraphoid axioms (SG). 

• There is no proper convex rank test which is coarser than T. 

• The convex rank test T is not submodular. 

We shall prove all three statements at once, by examining the semigraphoid equa- 
tions (SG"). As in Section 2, the 44 symbols in T are denoted with double brackets 
[ • I • ] , while the 36 symbols not in T are denoted with brackets [ • | • ] . With this dis- 
tinction between brackets, there are four symmetry types of semigraphoid equations 
that involve the 36 positive unknowns [ • | • ]. The full list is given in the Appendix: 



15|234 234|15 123|45 235|14 
124|35 245|13 134|25 345|12. 



And there are six classes containing four permutations each: 



12|5|34 25|1|34 13|5|24 
35|1|24 14|5|23 45|1|23. 



[1JL5|0], [2JL3|15], [2JL3|145], [2il_4|15], [2JL4|135], [3JL4|15], [3114|125]. 



Type I 
Type II 
Type III 
Type IV 



[3_U_5|12] + [3_U_4|125] 
[1_U_5|2] + [1113 1 25] 
[4_LL5 1 1] + [2115 [14] 
[1_U_2|5] + [2113 [15] 



[3_U_4|12] + [3_U_5|124] 
[1-113| 21 + [1_U_5|23] 
[2_U_5|1] + [4115 1 12] 
[2_U_3|5l + [1112 1 35] 
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After setting the 44 unknowns [ • | • ] to zero, we are left with 120 equations in the 
36 strictly positive unknowns. For instance, the first three types give 

Type I [3_U_5|12] = [3_LL4|12] 

Type II [1_LL5|2] + [1_LL3|25] = [1_LL5|23] 

Type III [4JL5|1] + [2_IL5|14] = [2iL5|l] + [4U_5|12] 

The axiom (SG") merely requires that each of these equations is individually 
solvable. This is obviously the case. Hence T is a semigraphoid. 

The 78 equations of Type I listed in the Appendix imply that all 36 positive 
unknowns must be equal. So, if another CI statement is added to the semigraphoid 
T, then all others must be added in order for (SG) to remain valid. This proves our 
second claim that T is a coarsest convex rank test. 

Given that the 36 unknowns [ • | • ] must be equal, the 12 Type II equations imply 
that their common value is zero, contradicting the requirement that they be positive. 
Hence the 120 orginal equations altogether have no non-negative real solution that 
is consistent with V. This proves our third claim that T is not submodular. □ 

Every semigraphoid for n = 5 corresponds to a 4-dimensional fan. Intersecting 
this fan with a sphere around the origin, we obtain a polyhedral cell decomposition 
of the 3-dimensional sphere. We do not know whether each of these 3-spheres can 
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be realized as the boundary of a 4-dimensional polytope. However, using ^3 §5], 
every semigraphoid can be represented by a 3-dimensional diagram as in Figure 2. 

For the specific semigraphoid T of Theorem l3.ll the diagram in Figure 2 is indeed 
the boundary of a 4-polytope with f-vector (14, 36, 32, 10). The following coordinates 
for this polytope were found by a direct calculation, using the techniques described 
in [2]. Each of the following ten row vectors represents a facet of our polytope: 
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20/37 


1/37 


10/37 


-2/37 



For instance, the last row represents the facet-defining inequality 

2 20 1 10 2 

37' X1 + 37' X2 + 37' X3 + 37- X4 " 37 ' * 5 " 

Here, we are considering the vectors (xi,X2,X3,Xi, x$) to be elements in the quotient 
of M 5 modulo the one-dimensional linear subspace spanned by (4,1,1,1,1). Our 
format is that of the software Polymake 7 j. If the above eleven lines are put in a 
file named mypolytope then the following command in Polymake will verify that 
this polytope does indeed have the combinatorial structure displayed in Figure 2: 

polymake mypolytope F_VECT0R VERTICES_IN_FACETS 

The 10 facets of our 4-polytope correspond to the facets of the 5-cube, and they 
comprise all classes of permutations in S5 in which the first or last coordinate is 
fixed. The facets corresponding to permutations with 1 or 5 in the first coordinate 
have seven vertices, twelve edges, and eight 2-faces. The facets corresponding to 
permutations with 2, 3 or 4 first have seven vertices, 13 edges, and eight 2-faces. 
The facets for 1 or 5 last are tetrahedra. The facets for 2, 3 or 4 last are cubes in 
which one edge has been contracted; they have seven vertices and 11 edges. 



4 The semigraphoid semigroup is not normal 

Continuing to assume n = 5, we now consider the linear map A in the Introduction. 
It maps the free abelian group Z 80 spanned by the CI statements to the free abelian 
group Z 32 with basis {ex '■ K C [5]} as specified in (^Q). The matrix representing A 
has 32 rows and 80 columns; each column has four non-zero entries: two +l's and 
two — l's. The rank of A is 26. The semigraphoid semigroup is ^4(N 80 ), the non- 
negative integer span of the columns of this 32 x 80-matrix. This is a subsemigroup 
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of Z 32 . Equivalently, the semigraphoid semigroup is the affine semigroup with 80 
generators and 120 relations (given in the Appendix). Note that the polyhedral 
cone dual to the semigraphoid semigroup is the cone of submodular functions. 

In the language of the vectors in Z 32 are called imsets, the columns of A 
are elementary imsets, and the elements of *4(N 80 ) are combinatorial imsets. A 
structural imset is a lattice point which lies in the polyhedral cone spanned by 
the elementary imsets. Studeny's question (Q2) whether each structural imset is 
combinatorial translates into the question whether the semigroup .4,(N 80 ) is normal. 

Theorem 4.1. The semigraphoid semigroup is not normal for n = 5. 

Proof. Consider the following element in the free abelian group Z 80 : 

[1JL5|2] + [Lil_4|3] + [2JL3|4] + [2_U_3|5] + [3_U_4|12] , , 

+ [2_U_5|13] + [1_LL2|45] + [1_LL3|45] + [4_LL5|23] - [2U_3|45]. { ' 

The image of this element under the map A : Z 80 — » Z 32 is the imset 

b := -e 2 - e 3 - e 4 - e 5 - e 23 + e 24 + 2e 2 5 + 2e 34 + e 35 - e 45 + 2ei 23 ,^ 
+ei 2 4 — ei 2 5 — ei34 + e\^ + 2ei45 — ei 2 34 — ei 2 35 — ei 2 45 — 61345. 

The imset b is structural because 2 • b is a combinatorial imset. It is the image of 

[4U_5|2] + [4_U_5|3] + [LLL3|4] + [LU_2[5] + [2iL5|14] + [3_LL4|15] 

+[1JL4|23] + [1_LL5|23] + [1_LL5|2] + [1_LL4|3] + [2_LL3|4] (5) 
+ [2_U_3|5] + [3_LL4|12] + [2_LL5|13] + [1_LL2|45] + [1_U_3|45] € N 80 

under the linear map A. 

Suppose that b were a combinatorial imset. Then there exists x G N 80 such 
that A ■ x = b. We write x = ^Vaj JJ_6j|i?j], where we allow repetition in the 
sum. In any elementary imset, the basis vector e$ occurs with coefficient — 1 or 0, 
and the basis vector ei 2 345 occurs with coefficient —1 or 0. However, neither e$ nor 
ei 2 345 appears in the imset b, so we conclude that \K{ \ = 1 or \Ki\ = 2 for all terms 
[ai_LL6ijjFQ] in the representation of x. The first four terms — e 2 — e^ — — e§ in 
b imply that x has precisely four terms [ai_LL&i|.fQ] with \K{\ = 1, and the terms 
— ei 2 34 — ei 2 35 — ei 2 45 — 61345 imply that x has precisely four terms with \Ki\ = 2. 

Each of the eight terms in x evaluates to an alternating sum of 4 terms under 
the map A. Some cancellation occurs among the resulting 32 terms. Prior to that 
cancellation, our imset had been written as the sum of two subsums, b = A ■ x = 

-e 2 - e 3 - e 4 - e 5 + e 24 + 2e 25 + 2e 34 + e 35 + ba 1 + e.A 2 - ei 2 5 - ei 34 - e Bl - e B2 

-e23-e45-eA 1 -eA 2 + 2e i23+ei24+ei35+2e 1 4 5 +e Bl +eB 2 -ei234-ei235-ei245-ei345, 

where |j4i| = |A 2 | = 2 and \B\\ = |5 2 | = 3. The first line is the sum of the four 
elementary imsets «4([aj_LL6j|ii'i]) with \Ki\ = 1, and the second line is the sum of 
the four elementary imsets with \K{\ = 2. A contradiction will arise when we try 
to determine the unknown pairs A\ and A%. The term — ei 2 5 in the first line must 
come from Ki = {2} or Ki = {5}. This implies that either {1,2} or {1,5} is in 
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A* = {Ai, A2}. Similarly, the term — ei34 shows that either {1,3} or {1,4} is in A*. 
Now consider the second line. The presence of the term 2ei23 implies that {1, 2} or 
{1,3} is in A*, and the term 2ei45 implies that {1,4} or {1,5} is in A*. The term 
ei24 shows that {1,2}, {1,4}, or {2,4} is in A*, and, finally, the term ei35 shows 
that {1,3}, {1,5}, or {3,5} is in A*. However, no such pair of pairs A* satisfies 
these six restrictions. This proves that b is not a combinatorial imset. □ 

The main point of the above proof was to show that the linear system A-x = b 
has no solution with non- negative integer coordinates. This can also be verified 
automatically using integer programming software. In fact, using such software we 
found that A • x = b has only one solution with non- negative real coordinates, 
namely, that unique solution x € (M>o) 80 is the expression in © scaled by 1/2. 

The reader might now inquire how the imset b was found. There are several 
algorithms that test whether a given affine semigroup is normal, including one re- 
cently proposed by Takemura, Yoshida and the first author and the method of 
Bruns and Koch [3| which is implemented in their software normaliz. 

Our original attempts to apply these methods directly to the 32 x 80-matrix A 
were unsuccessful. Instead we succeeded by partially computing a Markov basis for 
the matrix A using the software 4ti2 18^ The imset b was found by inspecting the 
partial results produced by 4ti2. We explain the details in the next section. 



5 Computations in toric algebra 

Let Q[CI n ] denote the polynomial ring over the field of rational numbers Q generated 
by the symbols [i-LLj \K]. Thus Q[CI n ] is a polynomial ring in j n unknowns, one for 
each 2-face of the n-cube C n . We write \\ CI n for the product of all the unknowns. 
We define the semigraphoid ideal to be the ideal Isg generated by the binomials in 
(SG'"). Thus the generators of Isg represent the semigraphoid axioms. Following 
|14| §7], we introduce the toric ideal I a which is obtained from Isg by saturation: 

I A ■= {Isg : (n ci -)°°)- ( 6 ) 

The binomials in I4 represent the vectors in the kernel of the linear map A : Z 7n — > 
Z 2 " . A minimal set of binomials which generates I4 is said to be a Markov basis for 
the matrix A. See [I] for a discussion of Markov bases in the context of statistics. 

Let us illustrate these concepts for n = 3. The polynomial ring QfCLj] has six 
unknowns, one for each facet of the 3-cube. They are the entries of the 2 x 3-matrix 

/[1_IL2|0] [LIL3|0] [2Ji_3|0]\ 

V[1_LL2|3] [1_LL3|2] [2113|1]/ ' 1 ' 

The semigraphoid ideal Isg is generated by the three 2 x 2-minors of the matrix (|7j). 
This is a prime ideal of codimension 2 and degree 3, and hence we have Isg = I A- 
Here the Markov basis for A consists precisely of the three semigraphoid axioms. 

We next consider the case n = 4. The polynomial ring QfCLi] has 24 unknowns, 
one for each 2-face of the 4-cube. They are the entries of eight 2 x 3-matrices as in 
(J7|, one for each of the eight facets of the 4-cube. Thus the semigraphoid ideal Isg 
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is generated by 24 quadrics, one for each of the 24 axioms (SG") in the list given 
in Section 2. For instance, the last axiom in that list translates into the quadratic 
binomial [1_LL2|4] • [2_LL3|14] - [2_LL3|4] • [11L2[34], which is one of the 24 generators 
°f ^SG- Using the software Macaulay2 (B| we derived the following result: 

Proposition 5.1. The semigraphoid ideal Isg is a radical ideal which is the inter- 
section of the toric ideal Ia and 17 additional associated monomial prime ideals. 

Before discussing this prime decomposition in detail, let us make a few general 
remarks. We wish to argue that toric algebra and algebraic geometry provide useful 
algorithmic tools for the research directions presented in |18j . For any ideal I of 
Q[CI n ] and any subset Q of the complex affine space C 7n , the variety Vq(I) is 
defined as the set of all vectors in Q which are common zeros of all the polynomials 
in /. Then Vc(Isg) is a complex variety, reducible for n > 4, one of whose irreducible 
components is the complex toric variety Vc(Ia). Inside this toric variety are the real 
toric variety Vs.(Ia)- Its non-negative part Vr >0 (Ia) is homeomorphic to the cone 
spanned by the elementary imsets. Our next result shows that the semigraphoids 
are precisely the points on these varieties whose coordinates are or 1. 

Theorem 5.2. The semigraphoids on [n] are in bisection with the points in V|o,i}(-^Sg)- 
The submodular semigraphoids on [n] are in bijection with the points in V^q^^Ia)- 

Proof. We replace the additive semiring {0, +} with the multiplicative semiring 
{1,0}. This translates from the additive notation (SG") to the multiplicative no- 
tation (SG'"). With this translation, the first statement in Theorem 15 .21 is obvious. 

The second statement is less obvious and is based on the geometry of toric 
varieties. Specifically, we shall use the characterization of facial index sets which is 
developed in ..Si. If we consider our specific 2 n x 7 n -matrix A then the role of the set 
{1, . . . , m} in [3] is played by the set of CI statements, and a subset of CI statements 
is facial for A if and only if it is submodular semigraphoid. With this observation, 
our second assertion follows from Lemma A. 2 in the Appendix of □ 

Using Theorem l5.21 we can study semigraphoids by studying the zero-dimensional 
ideals obtained by adding ( x 2 — x : x 6 CI n ) to the ideal 7sg or I a- For instance, 
with the command degree in Macaulay2 6l, it takes only a few seconds to compute 

#U {0 ,i } (/sg) = 26424 and W{o,i}{Ia) = 22108. (8) 

The difference between these numbers is explained geometrically by the prime de- 
composition in Proposition 15. H which we shall now describe in explicit terms. 

The 17 associated monomial primes of Isg come in three symmetry classes. 
First there are two primes of codimension 12. A representative is the ideal 

/ [1J12|0], [1J1_3[0], [1_IL4|0], [2iL3|0], [2_LL4|0], [3_LL4|0], \ 
\ [3_LL4|12], [2_U_4|13], [2_LL3|14], [1_LL4|23], [1_LL3|24], [1_U_2|34] / ' 

The semigraphoid ideal Isg has 12 associated primes of codimension 15, such as 

/ [1IL2|0], [1J13I0], [1_LL4|0], [3JL4|0], [1JJ_3|2], [1_LL4[2], [3JL4|2], [1J_L2|3],\ 
\ [2iL4|3], [1_LL2|4], [2_LL3|4], [3_U_4|12], [2_U_4|13], [2_U_3|14], [1_LL2|34] / ' 
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Next, Isg has three associated primes of codimension 16. A representative is 

/ [1_IL2|0], [1JL3|0], [2_LL4|0], [3_LL4|0], [21L4|1], [3JL4|1], [1_U_3|2], [3JL4|2], \ 
\ [1_LL2[3], [2_LL4|3], [1_LL2|4], [1_U_3|4], [3_U_4|12], [2_LL4|13], [1_LL3|24], [1U_2[34]/ ' 

Each of the 4316 non-submodular semigraphoids is a {0, l}-valued point not in 
V(IjO but in one of the 17 coordinate subspaces corresponding to these primes. 

Finally, the last associated prime of 7sg is the toric ideal I4. This ideal has 
codimension 13 and degree 396. Its minimal generating set consists of 52 binomials. 
Besides the 24 quadrics (the axioms), the Markov basis of A contains four cubics 

[2_U_3|1] • [3_LL4|2] • [1_U_3|4] - [3_LL4|1] • [1_U_3|2] • [2_LL3|4], 
[2_U_3|1] • [2_LL4|3] • [1_LL2]4] - [2_LL4|1] • [1_LL2]3] • [2_LL3|4], 
[1_U_3|2] • [1_U_4|3] • [1_LL2|4] - [1_U_4|2] • [1_U_2|3] • [1_LL3|4], 
[2_LL4|1] • [3_LL4|2] • [1_U_4|3] - [3_LL4|1] • [1_LL4|2] • [2_LL4|3], 

and 24 quartics such as 

[1_LL2[0] • [3_LL4|0] • [2_LL4|13] • [1_LL3[24] - [1_U_3|0] • [2_LL4|0] • [3J14|12] • [1_U_2|34]. 

We now come to case n = 5. It will be a challenge for future commutative algebra 
software to compute a primary decomposition of the semigraphoid ideal Isg for 
n = 5. At present we do not know even whether Isg is radical. Let us therefore 
focus on the main component of this ideal, namely, the toric ideal I a- Here our 
main goal is to compute its minimal generators, that is, the Markov basis of A. We 
attacked this problem using the software 4ti2 [Sj, and we now discuss the results. 

First, we started a Markov basis computation for the toric ideal I4 using the 
function markov of 4ti2, but this computation turned out to be non-trivial. In 
the hope that a counterexample would not involve all 80 variables, we set several 
variables to and tried to compute the Markov basis of smaller ideals that are con- 
tained in I4. For the one-day computation that finally produced a counterexample, 
we set the first 18 formal symbols to zero and found the Markov basis move 

g := (a + 2- [2_LL3|45] ) - {(3 + 2- [41L5|23]) eN 80 , where 

a = [4_L5|2] + [4U_5|3] + [1_IL3|4] + [1_LL2|5] + [2_U_5|14] + [3JL4|15] + [1_LL4|23] + [1_LL5|23], 
(3 = [1_LL5|2] + [1_LL4|3] + [2_LL3|4] + [2_LL3|5] + [3_LL4|12] + [2_LL5|13] + [1_LL2|45] + [1_LL3|45]. 

This lattice vector corresponds to a binomial x g+ — x s which is in the toric 
ideal I a and has the property that both of its monomials are not square-free. We 
then verified that x g+ — x g is not only indispensable for the smaller ideal (with 
18 variables set to zero) but also indispensable for I4. Recall (e.g. from pQ) that a 
binomial x g — x g in the toric ideal 1 a is called indispensable if 

{zG N 80 :^-z = ^-g+} = {g+,g-}. 

This means that the Markov move g corresponds to a 2-element fiber given by the 
right-hand side and consequently, g must belong to every Markov basis of I ^4. In 
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order to check this condition for our given move g, we computed the minimal Hilbert 
basis (that is, the <-minimal integer solutions) of the cone 

{(z, u) G R 81 : A ■ z - (A ■ g + ) • u = 0, (z, u) > 0}. 

This was done using the function hilbert of 4ti2 which produced precisely the 
two expected elements (g + , 1) and (g~, 1) within a few seconds. 

From our special Markov move g = (a + 2- [2_LL3|45]) — (f3+2- [4_LL5|23]), we then 
constructed the imset b presented in Section 4. We first checked that b was not 
a combinatorial imset by showing that Ax = b has no solutions with non-negative 
integer coordinates. Using the functions hilbert and rays of the program 4ti2, 
we computed the Hilbert basis and the extreme rays of the cone 

{(z,u)E M 81 : A- z = b • u and (z, u) > 0}. 

Both computations quickly finished. They showed that this cone has dimension one 
and is generated by the single vector (a+/3, 2). Consequently, the only non-negative 
real solution to A • x = b is (a + /3)/2, which is not an integer solution. 

We are currently in the process of computing the complete minimal Markov basis 
of the toric ideal for semigraphoids with n = 5. That Markov basis has well over a 
million elements. Yet, we are convinced that 4ti2 will succeed. The completion of 
that Markov basis will represent a computational milestone in toric algebra. 
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6 Appendix: The 120 semigraphoid axioms 

Here is the list of all 120 semigraphoid axiom for n = 5, grouped into triples accord- 
ing to which 3-face of the 5-cube they come from. The two types of brackets specify 
the non-submodular coarsest semigraphoid T which was discussed in Section yj 
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